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ABSTRACT 

The molecular network in an organism consists of tran- 
scription/translation regulation, protein-protein interac- 
tions/modifications and a metabolic network, together form- 
ing a system that allows the cell to respond sensibly to the 
multiple signal molecules that exist in its environment. A key 
part of this overall system of molecular regulation is there- 
fore the interface between the genetic and the metabolic net- 
work. A motif that occurs very often at this interface is a 
negative feedback loop used to regulate the level of the signal 
molecules. In this work we use mathematical models to inves- 
tigate the steady state and dynamical behaviour of different 
negative feedback loops. We show, in particular, that feed- 
back loops where the signal molecule does not cause the dis- 
sociation of the transcription factor from the DNA respond 
faster than loops where the molecule acts by sequestering 
transcription factors off the DNA. We use three examples, the 
bet, mer and lac systems in E. coli, to illustrate the behaviour 
of such feedback loops. 



INTRODUCTION 

Gene expression in bacterial cells is modulated to enhance the 
cell's performance in changing environmental conditions. To this 
end, transcription regulatory networks continuously sense a set of 
signals and perform computations to adjust the gene expression 
profile of the cell. A subset of such signals contains molecules 
that the cell can metabolize. These molecules range from nutri- 
ents to toxic compounds. A commonly occurring motif in the net- 
works sensing such signal molecules is a negative feedback loop. 
In this motif an enzyme used to metabolize the signal molecule 
is controlled by a regulator whose action, in turn, is regulated by 
the same signal molecule. This motif allows for genes that are not 
transcription factors to negatively regulate their own synthesis. 

Because these negative feedback loops are situated at the in- 
terface of genetic (1! 2 J and metabolic (|2||3} networks, under- 
standing their behavior is crucial for building integrated network 
models, as well as synthetic gene circuits (|4]|5]|6j. In fact, if one 
ignores the interface, the network topology gives the impression 
that feed back mechanisms are less frequent than feed forward 
loops Cl|8j. In addition, by ignoring feedback associated to sig- 
nal molecules one would also tend to overemphasize the modular 
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Figure 1: Schematic diagrams of four types of negative feedback 
loops containing proteins and signal molecules. Regulation of the pro- 
moter (arrow) transcribing the enzyme (E) that can metabolize the signal 
molecule (s) is shown in the first two columns. R represents the regula- 
tor. A network representation of each regulation mechanism is shown in 
the rightmost column. This is not a complete logical list of all combi- 
natorial possibilities, it includes only the ones where real examples have 
been found. 



features of the overall system Q and underemphasize the average 
number of incoming links to proteins. 

Even within the framework of a negative feedback loop there 
are several different mechanisms possible both for transcriptional 
regulation and for the action of the signal molecule. We list 
below four mechanisms which are present in living cells, with 
examples taken from E. coli (Fig. 0; 

(i) The regulator, R, represses the transcription of the enzyme, E, 
which metabolizes the signal molecule, s. The signal molecule 
binds to the repressor resulting in the dissociation of the R- 
operator complex and an increase in the production of E. This 
mechanism is exemplified by a negative feedback loop in the lac 
system (I10> . where the roles of R, E and s are played by Lad, 
/3-galactosidase, and lactose, respectively. 

(ii) R represses the transcription of E which metabolizes s. But 
here the signal molecule can bind to R even when it is at the 
operator site. When this happens the effect of R on the promoter 
activity is cancelled, or even reversed. Two examples of this kind 
are the bet ( I11II12> and mer ( I13> systems, which are involved in 
the response of cells to the harmful conditions of osmotic stress 
and presence of mercury ions, respectively. 
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(iii) Here the regulator, R, is an activator of the transcription of 
E when s is bound to it. Without the signal molecule, R cannot 
bind to the DNA site and activate transcription. For instance, 
MalT in complex with maltose is a transcriptional activator of 
genes which metabolize maltose. This mechanism differs from 
(ii) in that in the absence of s, R is a repressor in (ii) while here 
it does not affect the promoter activity. 

(iv) Here too, R alone cannot bind to the operator site. However, 
in contrast to (iii), R bound to s represses the transcription of 
E. Further, in this case E increases the production of the signal 
molecule, rather than metabolizing it, thereby again making 
the overall feedback negative. One such example is the regu- 
lation of de novo purine nucleotide biosynthesis by PurR ( fT4l[T5t . 

A major difference between these four loops is the manner in 
which the signal molecule acts. In (i) the binding of s to R dras- 
tically reduces its affinity to the DNA site. On the other hand, in 
(ii), (iii) and (iv), the signal molecule increases, or does not sig- 
nificantly alter, the binding affinity of R and can also affect the 
action of the regulator when it is bound to the DNA. Henceforth 
we will refer to these two methods of action as 'mechanism (1)' 
and 'mechanism (2)'. 

In this paper we have investigated how this difference in the 
mechanism of action of the signal molecule translates to differ- 
ences in the steady state and dynamical behaviour of the simplest 
kind of negative feedback loops containing proteins and signal 
molecules. These loops have only one step, E, between the reg- 
ulator and the signal molecule. Further, the regulator is assumed 
to have only one binding site on the DNA. We concentrate on the 
cases where R is a repressor and s lifts the repression (i and ii). In 
particular, we show that the two mechanisms differ substantially 
in their dynamic behaviour when R is large enough to fully re- 
press the promoter of E in the absence of s. We illustrate how the 
difference is used in cells by the examples of the bet, mer and lac 
systems. 



RESULTS 

Steady state behaviour of feedback loops 



First we consider how the steady state activity of the promoter of 
E responds to changes in the concentration of s for each of the 
mechanisms. 

Consider a feedback loop, like Fig. ^ (i), where the opera- 
tor can be found in one of two states: free, Ofico, and bound to 
the regulator, {RO), with the total concentration of operator sites 
being a constant: Otot = Ofree + (RO). We assume that the 
promoter is active only when the operator is free, and completely 
repressed when it is bound by R. This loop uses mechanism (1) 
and is an idealization of the lac system in E. coli. The promoter 
activity is given by: 



In steady state, Ofrcc and {RO) can be expressed as functions of 
the total concentration of regulators, i?tot, and the concentration 
of signal molecules, s. The expression also contains the parame- 
ters Kfio and h (the equilibrium binding constant for R-operator 
binding and the corresponding Hill coefficient), Kj^s and hg (for 
R-s binding.) Equation|6lin the Methods section contains all the 
details. The main effect of s is to decrease the amount of free R 
because _Rtot = ^froc + (Rs), where (Rs) is the concentration 
of the R-s complex. 

For a feedback loop using mechanism (2), the operator can 
be found in one of three states: free, Ofrco, bound to the reg- 
ulator, (RO), and bound to the regulator along with the sig- 
nal molecule, (RsO). The total concentration of operator sites, 
Otot = Ofree + (RO) + (RsO), is Constant. The promoter activ- 
ity has a basal value (normalized to 1) when the operator is free. 
When the regulator alone is bound it represses the activity. We as- 
sume the activity in this state is zero. When the operator is bound 
by R along with s the activity returns to the basal level. This is 
an idealization of the bet system. Here, the promoter activity is 
given by: 

Ofree + (RsO) 



A. 



(2) 



Ai 
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free 



o 



free 



Ot, 



Oft.ce + {RO) ■ 



(1) 



Ofree + (RO) + (RsO) ■ 

The main effect of s comes from the second term in the numer- 
ator of equation 2, which is the concentration of the R-s-operator 
complex. As in the case of Ai, in steady state the activity can 
be expressed in terms of i?tot and s. Because of the third state of 
the operator, (RsO), the expression for A2 includes one more pa- 
rameter, KrsO^ the equilibirum binding constant for R-operator 
binding when s is bound to R (see equation |8] in the Methods 
section for details.) 

For mechanism (2), we mainly consider the case where 
KusO = Kro, i-e., the binding of the signal molecule does not 
change the binding affinity of the regulator to the operator This is 
the simplest situation and illustrates the basic differences between 
the two mechanisms. In real systems these binding constants 
are often different. However, as we show, for bet and mer the 
inequality of K^sO and Kno does not obscure the differences 
caused by the two mechanisms of action of the signal molecule. 
This is because the main effect of changing KbsO is simply to 
shift the position of the response curve. Only when K^sO be- 
comes very large (which results in dissociation of R from the 
operator when s binds to it, as in lac) does mechanism (2) effec- 
tively reduce to mechanism (1). 

Figure 12] shows the activities Ai and A2 for a range of values 
of _Rtot and s. The following observations can be made from the 
figure: 

(a) For sufficiently small values of i?tot there is no difference 
between Ai and A2. 

(b) From Rtot/KRO = 1 and higher, Ai requires larger and 
larger s to rise to its maximum value, i.e., its effective bind- 
ing constant A'cff increases with i?tot (where we define K^s 
to be the value of s at which the activity is half- maximum.) 

(c) A2, on the other hand, has a KcS which is remarkably 
robust to changes in -Rtot, remaining close to Kns for 
Rtot/KRO > 1- 
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Figure 2; Promoter activities for the two meclianisms as a function of 
Rtot and s. Red: Ai (equation Q] and [6|); Blue; A2 (equation and 
[8|) Parameter values (see equations [6| and [Sj are; Kro = 1, Krs = 
100, h = 2,hs — 1 and KfcsO = 1 for mechanism (2). Choosing 
h = 2 assumes that two protein subunits are involved in the R-operator 
binding. 



(d) Zooming in to the low s region shows that A2 rises more 
steeply than Ai for small s values. 

All these features can be explained by taking a closer look at 
the equations for Ai (eq. [nand|6jl and A2 (eq. |2land|8]) Taking 
the observations in reverse order, first we see that for small values 
of s, the promoter activities rise as a power of s: A{s) « ^1(0) + 
const. X s". From equations|6land|8lwe find that this power a = 
hs for mechanism (1) and a — h x hg for mechanism (2). Thus, 
as long as h > 1, mechanism (2) will have a steeper response at 
small values of s. 

Next, let us consider the amount of inducer s ~ Kcs needed 
to half-activate the promoter under the two mechanisms. The fact 
that A'off is close to Kf>s for A2 is because of the term (RsO), 
which occurs in both the numerator and the denominator of equa- 
tion|2] When i?tot is large enough (i.e., i?tot > Kbxj), the op- 
erator is rarely free, and the constant term (=1) in eq. |8]can 
be disregarded from both numerator and denominator. In that 
case A2 only depends on the ratio between the binding affinities 
Khso/Kro- Accordingly A2 becomes independent of the value 
oiRtot for Rtot > Kro = 1. 

On the other hand, the activity Ai is always highly dependent 
on Rtot- From equation|6lwe see that Ai reaches half-maximum 
when {RO) = Oftoc- This happens when (s/Krs)'^'^ w 
{Rtot/ Kro)- Therefore Kcs is an increasing function of i?tot 
for mechanism (1). 

when i?tot drops below 
a regime where the inducer is 
For our standard parameters, 
< Kfio implies that Ofrco and 
(RO) dominate (RsO) in equation |2l Thereby the functional 
form of the activity A2 approaches that of Ai, as indeed seen 
from the Rtot/ Kro < 1 regime in Fig. 2. 

In addition to these mathematical arguments, the above obser- 
vations can be understood physically from the nature of the pro- 



For both mechanisms, 
min{KiisO,KRo) we enter 
not needed for derepression, 
repressor concentration Rtot 
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bet 
mer 
lac 


1 
1 
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0.32 
0.13 
0.06 


0.83 

13.71 
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0.29 
3.5 
1000 
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Table 1 : Values of parameters in equation 3 for three systems 
found in E. coll. In the case of lac we used a simplified case, 
where the lac promoter is repressed by Lad binding to a single 
operator, 01 . 



cesses allowed in mechanisms (1) and (2). Consider the case of a 
fully repressed promoter (when i?tot ^ Kro)- Mechanism (1) 
then requires dissociation of R from the operator for the activity 
to rise and this is associated with a free energy cost proportional 
to \ii{Rtot/KRo)- In mechanism (2) there is no such cost and 
therefore a smaller amount of s is required to achieve the same 
level of inhibition of R. Thus, for genes which are typically com- 
pletely repressed, and transcription of which, on the other hand, 
may be needed suddenly, mechanism ( 1) is inferior to mechanism 
(2) because it needs a larger amount of s. After first discussing 
three real systems, we will elaborate on this response advantage 
by comparing the explicit time dependence of the two mecha- 
nisms in the next section. 

The most general framework within which the promoter ac- 
tivities of the enzymes in the bet, mer and lac systems can be 
represented is the following generalization of equations 1 and 2: 



A = 



aObco + KRO) + -/{RsO) 
Ofrcc + {RO) + {RsO) 



(3) 



Equation|9lin the Methods section shows the dependence of A 
on i?tot and s. a, /3, 7 are constants dependent on which system 
we are trying to describe, a is the promoter activity in the absence 
of R and is used as a reference (1.00). [3 and 7 are the relative 
promoter activities in the presence of R alone, and R together 
with s, respectively. Table 1 shows the values of a, (3, 7 as well 
as how the binding affinity of R to the operator is changed by the 
binding of s (the ratio Kr^o/Kro) for the three systems. We 
have used the Hill coefficients h = 2 (assuming that two protein 
subunits are involved in DNA binding) and hg ^ 1 (for simplicity 
and to compare with Fig. |2]) Mechanism (1) and (2) are special 
cases of this equation. Equations^andl^l for mechanism (1), are 
obtained by setting a = 1, f3 = Q and taking the limit Krso -^ 
00 (the value of 7 is irrelevant in this limit). Equations|2]and|8l for 
mechanism (2), are obtained by setting a = l, [3 — 0, 7 = 1. 
From Table 1, it is clear then that lac uses mechanism (1) and 
bet uses mechanism (2). mer is an even more extreme case of 
mechanism (2) where the {RsO) term has a much larger weight 
(7 ^ 1) than the idealized mechanism (2). 

Fig. |3lshows the response curves for the bet, mer and lac sys- 
tems, bet and mer, representatives of mechanism (2), and lac, 
a representative of mechanism (1), indeed behave similar to the 
idealized versions of the two mechanisms investigated in Fig. |2l 
The difference between bet and mer is the result of changes in the 
binding affinity of R in the absence and presence of s. 

A further complication that could occur in real systems is that 
the probabilities of RNA polymerase recruitment could be differ- 




Figure 3; Promoter activity. A, (see equation and 01 versus s 
for the bet (blue), mer (purple), lac (red) systems in E. coll. Be- 
cause the promoter activities for the three systems have different min- 
imum and maximum values, for easier comparison we have plotted 

{A - ^min)/(^max - ^min), where Aynin = A{s = 0) and Amax = 

A{s — cxj). /;] all cases, we kept Rtot/Kao ~ 10. 



ent for different states of the operator. We find that taking the 
changing probabilities of RNA polymerase recruitment into ac- 
count does not change the mathematical form of the equations 
for the promoter activities (see Methods.) Thus, this additional 
complication does not affect our results. 

Dynamical behaviour of feedback loops 

We now turn to an analysis of differences in the temporal be- 
haviour of the feedback mechanisms. We model the dynamics by 
two coupled differential equations: 






E 



and 



ds 
dt 



—r — c— kEs. 



E and s represent the concentrations of the enzyme and signal 
molecule, respectively. The first term in the dE/dt equation is 
the rate of production of E which is equal to the promoter activ- 
ity, A (equations |3] and |9} ' . The second term represents degra- 
dation of E. The second equation describes the evolution of the 
concentration of s; it increases if there is a source, c > 0, of s (for 
instance from outside the cell) and decreases due to the action 
of the enzyme E. In the first equation both terms could be mul- 
tiplied by rate constants, representing the rates of transcription, 
translation and degradation. However, we have eliminated these 
constants by measuring time, t, in units of the degradation time 
of E, and by rescaling E appropriately (see the Methods section 
for details.) Thus, in these equations, E and t are dimensionless, 
with E lying between and 1 . k can then be interpreted as the 
maximum rate of degradation of s in units of the degradation rate 
ofE. 

Fig. 0t (left panel) shows what happens if the cell is subject 
to a sudden pulse of s. That is, the source c = always, but at 

' In using the steady state expressions for activities we are, in effect, assuming 
tliat the binding and dissociation of R to the operator and s to R occur on a much 
faster timescale than the transcription and translation of E. 








Figure 4: a. Left panel: time course for levels ofE and s when there is 
no source ofs(c — 0) and the system is subject to an instantaneous pulse 
lifting the value of s from zero to 10 x Kbb at time i = 0. The shaded 
region corresponds to s < Krs- Right panel: the time (after the pulse), 
T, required for each mechanism to reduce the value of s down to Krs, 
for different sizes of the initial pulse, b. Leftpanel: time course of levels 
ofE and s when a source ofs(c= 10 x Krb per degradation time ofE) 
is suddenly introduced at time t — 0. Right panel: Difference between 
the steady state values of s for the two mechanisms as a function of 
the value of the source term, c. In all plots the red, dashed line indicates 
mechanism (1) and the blue, solid line, mechanism (2). Parameter values 
are: Krq = 1,Krs = 100, /i = 2,h, = l,k = 10, and Krm = 1 
for mechanism (2). The active degradation rate k has been chosen to be 
much larger than the degradation rate ofE (which is unity). 



time t = the concentration of s abruptly jumps from zero to 
10 X Krs- This triggers an increase in the production ofE which 
then starts to decrease the concentration of s. There is no fur- 
ther addition of s to the system, so eventually all of it is removed 
and the system returns to its condition before the pulse. From 
the figure we see that, for the same parameter values, mechanism 
(2) results in a much faster removal of s because the response of 
E to the pulse is larger. The right panel adds further evidence 
to this conclusion. It shows, for both mechanisms, perturbed by 
varying sized pulses of s, the time taken for the concentration of 
s to fall to Kjis- This measure shows that mechanism (2) gen- 
erally responds faster than mechanism (1). The two mechanisms 
converge for small perturbations because there is no signal to re- 
spond to (levels of s are very low), and for very large perturba- 
tions because then the promoter becomes fully activated by the 
huge concentration of the inducing molecule s. 

Fig. 0J) shows what happens when the cell is subject to the 
appearance of a constant source of s. At time t = Q the value of 



c abruptly jumps from zero to 10 x Kjis per degradation time of 
E. In response, the production of E is increased and eventually 
reaches a new steady state value to deal with the constant influx 
of s (left panel). From the right panel of the figure it is evident 
that mechanism (2) is able to suppress the amount of s much more 
than mechanism (1) for most values of the rate of influx. Again, 
for similar reasons, the two mechanisms converge at small and 
large values of c. 

These observations apply for the case when KrsO — Kro 
for mechanism (2). The only effect on the dynamical equations 
caused by changing the ratio Knso/Kuo lies in the expression 
for A in the first term of the dE/dt equation. As mentioned in 
the previous section, changing this ratio mainly results in shifting 
of the response curve and as Krso is increased, A2 approaches 
Ai . For the dynamics this results in an increase in r (for a pulse) 
and in the steady state value of s (for a source) as Kjiso/Kro 
is increased. These values approach those for mechanism (1) in 
the limit Krso/Kro ^^ 00. The amount by which Kj^sO has 
to be boosted to effectively reduce mechanism (2) to mechanism 
(1) increases with increasing i?tot, as in the steady state case. 



DISCUSSION 



In the present paper we have discussed various strategies for 
negative feedback mechanisms involving the action of one signal 
molecule on a transcription factor In particular, we have inves- 
tigated two broadly different ways in which the signal molecule 
may change the action of the transcription factor: first, it could 
inhibit its action by sequestering it, and second, it could bind to 
the transcription factor while it is on the DNA site and there alter 
its action. The first mechanism occurs when the binding of the 
signal molecule reduces the affinity of the transcription factor to 
such an extent that it cannot subsequently remain bound to the 
DNA. This kind of inhibition of the transcription factor occurs in 
the lac system, where (allo)lactose reduces the binding affinity of 
Lad to the operator 01 by a factor of 1000 ( Il7t . This mechanism 
has also been exploited in synthetic gene networks (2]|5j. In the 
second mechanism the binding affinity is not altered that much; 
bet and mer belong to this category. In the case of mer the pres- 
ence of the signal molecule reverses the action of the transcription 
factor, changing it from a repressor to an activator J13> . 

In steady state, the two mechanisms differ most when the lev- 
els of the transcription factor are large enough to ensure substan- 
tial repression in the absence of the signal molecule. The under- 
lying reason for these differences is that, in this regime of full 
repression, for each transcription factor that binds to the signal 
molecule there is, for mechanism (1), an extra energy cost for 
the dissociation of the transcription factor from the DNA. The 
dynamical behaviour of feedback loops based on mechanisms 
(1) and (2) also differ substantially when promoters are in the 
fully repressed regime. We have shown that when the systems 
are perturbed by the sudden appearance of either a pulse or a 
source of signal molecules, mechanism (2) is generally faster and 
more efficient than mechanism (1) in suppressing the levels of the 
molecule. This prediction could be tested using synthetic gene 



circuits which implement these two mechanisms, for instance by 
extending the circuits built in ref. (0. In addition, this obser- 
vation fits neatly with the fact that the bet and mer systems use 
versions of mechanism (2), because they respond to harmful con- 
ditions (osmotic stress and the presence of mercury ions, respec- 
tively) and therefore need to respond quickly, while mechanism 

(1) is associated with lac, a system involved in metabolism of 
food molecules which therefore does not need to be as sensitive 
to the concentration of the signal molecules. In the case of lac it 
is probably energetically disadvantageous for the cell to respond 
to low levels of lactose sources (II8> . 

The differences between mechanisms are clear when they are 
compared keeping all parameters constant. In cells, however, pa- 
rameter values vary widely from one system to another which 
can obscure the differences caused by the two mechanisms. For 
instance, it is possible to increase the speed of response of mech- 
anism (1) by reducing the K^s value (i.e., increasing the binding 
strength between the regulator and the signal molecule.) Keep- 
ing all other parameters constant Kns needs to be decreased by 
a factor 10 for mechanism (1) to behave the same as mechanism 

(2) when _Rtot = 10 x Kro- This factor increases as _Rtot in- 
creases, i.e., as the repression is more complete. This can again 
be understood in terms of the extra energy cost for mechanism 
(1); increasing Krs sufficiently makes the extra energy cost in- 
significant compared to the R-s binding energy. Thus, a negative 
feedback loop in a real cell which needs to respond to signals on 
a given fast timescale could do so either by using mechanism (2), 
or by using mechanism (1) with a substantially larger R-s bind- 
ing affinity. For signal molecules where it is not possible for the 
R-s binding to be arbitrarily strengthened, mechanism (2) would 
be the better choice. On the other hand, mechanism (2) also has 
its disadvantages. For instance, at promoters with complex reg- 
ulation the DNA bound transcription factor using mechanism (2) 
may interfere with the action of other transcription factors. 

In Figure 1 we showed 4 examples, and have extensively dis- 
cussed example (i) and (ii). Another implementation of mecha- 
nism (2) is example (iii), with an activity A3 ~ (RsO) / [Ohcc + 
(RsO)]. In general this regulatory module is at least as efficient 
as mechanism (2), with a dynamical response which is even more 
efficient in the intermediate range of s (around Kbs)- The loop in 
Fig. [fliv) is, on the other hand, a different kind of negative feed- 
back from the other three examples. It involves synthesis of the 
signal molecule, and thus is aimed at maintaining a certain con- 
centration of the molecule, rather then minimising or consuming 
it. In practice, it is the kind of feedback that is common in biosyn- 
thesis pathways, where it helps maintain a certain level of amino 
acids, nucleotides, etc., inside the cell. 

The simple one-step, single-operator negative feedback loops 
investigated here clearly indicate that the mechanism of action of 
the signal molecule is a major determinant of the steady state and 
dynamical behaviour of the loop. Additional complexity in the 
mechanism of regulation (e.g., cooperative binding of a transcrip- 
tion factor to multiple binding sites) or of the regulatory region 
(competing transcription factors or multiple regulators respond- 
ing to different signals) JI9II20> would open up more avenues for 
the differences between the two mechanisms to manifest them- 
selves. 



These feedback loops form the link connecting the genetic and 
metabolic networks in cells. In fact, such loops involving signal 
molecules are likely to be a dominant mechanism of feedback 
regulation of transcription. Feedback using only regulatory pro- 
teins, without signal molecules, is probably too slow because it 
relies on transcription to change the levels of the proteins. Neg- 
ative auto-regulation can speed up the response of transcription 
regulation J21> . Nevertheless, feedback loops based on transla- 
tion regulation J22I i23l , active protein degradation J24I I25> or 
metabolism of signal molecules will certainly be able to oper- 
ate on much faster timescales. This is probably why feedback 
loops are rare in purely transciptional networks, which has con- 
tributed to the view that feed forward loops are dominant motifs 
in transcription regulation. Taking feedback loops involving sig- 
nal molecules into account alters this viewpoint substantially. In 
E. coli the number of feedforward loops in the transcription reg- 
ulatory network has been reported to be 40 QIIJ- Based on data 
in the EcoCyc database (i2ji, we know that there are more than 
40 negative feedback loops involving signal molecules where the 
regulation is by a transcription factor. Adding this many feed- 
back loops to the genetic network would also change the network 
topology substantially. In particular, it would diminish the dis- 
tinction between portions of the network that are downstream and 
upstream of a given protein. The effect of this would be to make 
the network more interconnected and reduce the modularity of 
the network by increasing the number of links between appar- 
ently separate modules. 



METHODS 

Promoter activity for mechanism (1) 



The operator can exist in one of two states: (i) free, Ofrcc- and 
(ii) bound to the regulator, {RO). If the concentration of free 
regulators is -R&oc then 



-Rfr, 



i?t 



1 + {s/KR,)hs 



and 



{RO) 



(Rtot/KBo) 



Ofr, 



l + is/KRs)''^ 
Using these and Otot = Oftcc + {RO), we get: 

Ofroc 1 



Ai 



Ou 



1 



l + is/Kii,)'-^ 



(4) 



(5) 



(6) 



Promoter activity for mechanism (2) 

The operator can exist in one of three states: (i) free, Ofroo, (ii) 
bound to the regulator, {RO), and (iii) bound to the regulator 
along with the signal molecule, {RsO). 

Again, with similar assumptions, we get equation |4] and [S] for 
i?frcc and {RO) plus an additional expression for {RsO): 



{RsO) 



{Rtot/KR,o){s/KR,y^ 
1 + {s/Krs)''^ 



Oftc 



(7) 



Using Otot ~ Ofroo + {RO) + {RsO) and equation|2lfor A2, 
we get: 



1 



A,- 



(-R'°VKfi.o)(s/-fCH.)''° 

l + (s/-fS"H=)'' = 









(8) 



General expression for promoter activity 

The most general expression for the activity, shown in equation 3, 
can also be rewritten using the expressions for {RO) and {RsO) 
calculated above: 



{RO) 



f Rhc 

[IT 



RO 



o 



free- 



Similarly, the concentration of regulators bound to signal 
molecules is 

{Rs) =i^] R 



lRs 



and the total concentration of regulators, a constant, is given by 

-Rtot = Rtrcc + {RO) + {Rs). 

We assume that the number of signal molecules is much larger 
than the number of regulators which, in turn, is much larger than 
the number of operator sites, i.e., s ^ i?tot ^ Otot- Then we 
can take s to be approximately constant and we can take {RO) ^ 
{Rs), giving: 



Rt, 



Rfree + 



K 



Rs 



RfT< 



A = 



a + P 



{R""/Kro) 
1+(s/Kb_,)'' = 



{R"'*/KRso){s/Kn,)''- 



l+(s//-fHs)"= 



iR*''yKa,o)is/KR,)'-^ 
1 + {s/Kr,)'^^ 



(9) 



Taking into account RNA polymerase recruitment 

A more correct, but more cumbersome, way to calculate the 
promoter activities is to explicitly take RNA polymerase into ac- 
count. Then, in the most general case, the system can be in one 
of 6 states: 

(1) R not bound to operator, RNAP not recruited: weight=l. 

(2) R not bound to operator, RNAP recruited: wt=piP. 

(3) R bound to operator, RNAP not recruited: wt={RO). 

(4) R bound to operator, RNAP recruited: v/t={RO)p2P- 



(5) R-s bound to operator, RNAP not recruited: wt=(i?sO). 

(6) R-s bound to operator, RNAP recruited: wt={RsO)p3P. 

Here pi,2,3 are the probabilities (per concentration) for recruit- 
ment of RNA polymerase in the three different states of the oper- 
ator, and P is the concentration of RNA polymerase. Taking the 
promoter activity to be when the polymerase is not recruited 
and a',/5',7' in states (2), (4) and (6), respectively, the activity 
can be written as follows: 



A 



{pia'+P2l3'{RO) +P3j'{RsO))P 
1 + Pi + (1 + P2)iRO) + (1 + P3)iRsO) ■ 



By absorbing the constants pi.2,3 into Kj^s, Kro and Kjiso, we 
recover equation|9] 

Rescaling of the dynamical equations 



With all rate constants included, the dynamical equations for 
the time evolution of the concentrations of E and s can be written 
as follows: 

= kiA{Rtot,s)-jE, 



dt 



dt 



= c! - k'Es. 



Now measuring time in units of the degradation time of E: t' = 
7t, and transforming E using E' = E{j/ki), we get 

dE' 

— ^AiRtot,s)-E' 

ds c' k'ki I 
dt' 7 7^ 

which, with c = c'/7 and k = k'kij^"^, are the equations used in 
the main text. 
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